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(54) Digital multimedia watermarking for source identification 



(57) A system and method for communicating a de- 
vice's capabilities uses a digital watermark embedded 
in content data. The watermark includes parameters 
concerning a source unit's communications capabilities. 
The watermark is embedded within content data, such 
as multimedia data, of a data packet. A destination unit, 



upon receiving the data packet detects if a watermark 
is present, and if so extracts the source's capability pa- 
rameters from the watermark. The destination unit then 
negotiates with the source unit to use certain capabilities 
based on the source capability information contained in 
the watermark. 
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Description 

BACKGROUND OF THE INVENTION 
Field of the Invention 



[0001 J The invention relates to communicating infor- 
mation using digital watermarks. 

Description of the Related Art 

[0002] The technique of marking paper with a water- 
mark for identification Is as almost as old as papermak- 
ing itself. With the advent of digital media there are many 
techniques for applying this ancient art to our new tech- 
nologies. Such a watermark, applied to digital media, is 
referred to as a digital watermark. A digital watermark 
is described by M. Miller et al., A Review of Watermark- 
ing Principles and Practices, in Digital Signal Process- 
ing in Multimedia Systems , 18, 461-85 (K.K. Parhi and 
T. Nishitani, Marcell Dekker, Inc., 1999), as a piece of 
information that is hidden directly in media content, in 
such a way that it is imperceptible to a human observer, 
but easily detected by a computer 
[0003] Although the applications of using digital wa- 
termarks differ from the applications of using paper wa- 
termarks, the underlying purpose and approach remain 
the same. The conventional purpose of using a digital 
watermark is to identify the original document, identify 
a legitimate document or prohibit unauthorized duplica- 
tion. The approach used in applying a digital watermark 
is much the same as in watermarking of paper. Instead 
of using mesh to produce a faint indentation in the paper 
providing a unique identifiable mark, a digital pattern Is 
placed into an unused area or unnoticeable area of the 
image, audio file or file header. Digital watermarks have 
also been used to send information concerning the mes- 
sage in which the watermark is embedded, including in- 
formation for suppressing errors in signal transmission 
and calibration information. These conventional uses of 
watermarks, however, fail to fully integrate an intelligent 
watermark capable of supplying information along with 
the original signal, image or packet of data. 
[0004] More specifically, prior digital watermarking 
techniques suffer from the shortcoming of relying on 
source Identification being tied Into the network trans- 
port protocol. Conventional source identification and au- 
thentication schemes use a source identification field 
embedded in a packet header by the network or trans- 
port layer for ascertaining source origin and authentica- 
tion. Since network and transport layer headers get 
stripped off of a packet by those layers, these schemes 
limit identification and authentication to being performed 
by the data transport or network protocols. 
[0005] Current fallback schemes to support the capa- 
bilities of, and to be compatible with, preexisting and de- 
ployed equipment, referred to here as legacy equip- 
ment, also inhibit the growth and deployment of new and 
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advanced features in multimedia voice, video and data 
equipment. For example, in the military radio environ- 
ment LPC-10e vocoders (voice operated recorder), 
which use linear prediction compression (LPC), have 
5 been in use for years and are widely deployed. However, 
performance of the LPC-10e vocoder is inadequate in 
severely degraded background noise environments. 
Newer technologies, such as in the Federal Standard 
MELP (mixed excitement linear processing) vocoder, 

10 have significantly reduced background noise effects. 
The capabilities of such newer technologies often go un- 
used because currently deployed fallback mechanisms 
default to the lowest common denominator capabilities 
of the communicating devices. That is, these fallback 

is schemes reduce the operating capability of the de- 
ployed equipment to the lowest capability level of the 
devices in communication (e.g. to the capabilities of the 
legacy l_PC_10e vocoder). Accordingly, the advanced 
capabilities of new equipment goes underutilized until 

20 all legacy equipment in a network is upgraded. 

[0006] Even after the legacy equipment leaves the 
network, the voice data streams present in the network 
and that were generated to be compatible with the LPC- 
10e vocoders remain in the legacy LPC-10e waveform 

25 even though such a waveform is not required for oper- 
ation once the legacy equipment is removed from the 
communication network. This is due to the fact that the 
newer radios are not capable of discriminating new 
equipment sources from legacy equipment sources 

30 purely from the transport layer information in the trans- 
mitted data stream. This prevents the new equipment 
from automatically negotiating capabilities with other 
devices to operate with the greatest capabilities com- 
mon to the communicating devices. 

35 

SUMMARY OF THE INVENTION 

[0007] Therefore, in light of the above, and for other 
reasons that will become apparent when the invention 

40 js fully described, an aspect of the invention is to auto- 
matically negotiate communication parameters be- 
tween communicating devices based on the capabilities 
of those devices. This can be accomplished by including 
capability information in a digital watermark embedded 

45 in an information field of a message transported be- 
tween the communicating devices. 
[0008] A further aspect of the invention enables a de- 
vice in a communication network to negotiate a common 
set of communication capabilities for devices in the net- 

50 work to use, without relying on information contained in 
a packet header. 

[0009] Yet another aspect of the invention generates 
a digital, watermark including Information concerning a 
device's capabilities. 
55 [001 0] A still further aspect of the invention detects a 
digital watermark in an information field of a data re- 
ceived from a source unit, extract from a watermark in 
the packet information concerning the source units ca- 
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pabilities. 

[001 1 ] The aforesaid objects are achieved individual- 
ly and in combination, and it is not intended that the in- 
vention be construed as requiring two or more of the ob- 
jects to be combined unless expressly required by the 
claims attached hereto. 

[0012] The above and still further objects, features 
and advantages of the invention will become apparent 
upon consideration of the following descriptions and de- 
scriptive figures of specific embodiments thereof. While 
these descriptions go into specific details of the inven- 
tion, it should be understood that variations may and do 
exist and would be apparent to those skilled in the art 
based on the descriptions herein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] 

Fig. 1 is a block diagram of a communication sys- 
tem using a digital watermark to convey in- 
formation concerning a source processor's 
capabilities. 

Fig. 2 is a flowchart illustrating a process for nego- 
tiating a set of communication capabilities. 

Fig. 3 is a flowchart illustrating a process for ex- 
tracting a digital watermark containing infor- 
mation concerning a source's capabilities. 

Fig. 4 is a block diagram of a wireless radio source 
unit that generates a watermarked speech 
frame that includes information concerning 
the capabilities of the radio. 

Fig. 5 is a block diagram of a wireless radio desti- 
nation unit that receives and processes a wa- 
termarked speech frame that includes infor- 
mation concerning the capabilities of the 
wireless radio source unit shown in Fig. 4. 

Fig. 6A is a diagram illustrating an operational proc- 
ess of the wireless radio source unit shown 
in Fig. 4. 

Fig. 6B is a diagram illustrating an operational proc- 
ess of the wireless radio destination unit 
shown in Fig. 5. 

DETAILED DESCRIPTION 

[0014] The invention is. described below with refer- 
ence to the above drawings, in which like reference nu- 
merals designate like components. 
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Overview 

[001 5] The invention uses a signature structure, such 
as a watermark, that provides control information con- 

5 tent embedded in application information, rather than a 
passive watermark that provides only identification in- 
formation. This technique also recognizes the need to 
be able to tailor the information content to customize an 
application at either a source or a destination. 

to [0016] A signature, containing information concerning 
attributes of a source unit is periodically embedded at 
pseudo-random points in a transmitted data stream al- 
lowing the source to watermark a signal, such as a mul- 
timedia signal. Using such a signature to watermark the 

'5 data stream allows a device at the data stream's desti- 
nation to identify and authenticate the source's capabil- 
ities. The watermark can contain, for example, source 
capability content, including a source ID, operational 
modes and capabilities of the source and application 

20 specific performance parameters. 

[0017] The watermark can be inserted after applica- 
tion layer multimedia processing has been performed, 
but prior to the network and transport layer processing. 
Accordingly, the watermark is embedded within the mul- 

25 timedia signal prior to network and transport layer head- 
ers being applied to packets of the data stream. This 
allows the watermark to be applied to the data stream 
without effecting either application-level or transport- 
level processing of the digital multimedia data stream. 

30 [0018] A signature is a group of bits that indicate a 
specific attribute of the source unit, such as a capability 
that the source unit possesses or lacks. The type of in- 
formation that can be used as a signature includes any 
source -specific attribute. For example, in a wireless 

35 communications system, one or more signatures can in- 
dicate the audio and video compression capabilities, the 
application software or operating system revision num- 
bers, the ID number of the source, the audio handset 
capabilities including the number of bits and audio fidel- 

40 ity of the source. The signature can be applied as a short 
duration digital signal, and can be placed as a water- 
mark in non-critical points in the data stream, such that 
its effects on the resulting reconstructed multimedia 
stream are Imperceptible to the human. For instance, a 

45 signature that indicates a source unit's capabilities can 
be placed in the watermark. 

[0019] In an audio application, the signature appears 
as a short duration pattern placed at non-critical bits in 
the compressed audio signal. The non-critical bits can 

so be predetermined so that the source and destination 
units both know a priori which bits in the signal are the 
non-critical bits with which the watermark is embedded 
In the signal. For example, for the MELP vocoder, the 
non-critical bits can include the least significant bits of 

55 the Multi-stage Vector Quantization of LPC coefficients, 
the Jitter Index bit, the least significant bits of the Sec- 
ond Gain Index, and the least significant bits of the Fou- 
rier Magnitude, as well as spare or unused bits. For 
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LPC-10e, the non-critical bits can include the least sig- 
nificant bits of the reflection coefficients. 
[0020] In a video application the signature appears in 
randomly placed pixel locations in either a still image or 
a full-motion video stream. For example, In a video code 5 
conforming to the H.263 standard, the watermark can 
be placed in the least significant bits of the unrestricted 
motion vectors and the Discrete Cosine Transform 
(DCT) coefficients. In still image encoding, such as in a 
JPEG image, the non-critical bits can include the least 10 
significant bits of the quantized DCT coefficients. 
[0021] The destination equipment detects the water- 
marked signature and ascertains the source's capabili- 
ties based on the information contained in the water- 
mark. This allows the destination equipment to negoti- is 
ate to higher levels of capabilities with new equipment 
that may be present in the network. 
[0022] Since the watermark is not perceptible to a hu- 
man, it does not significantly effect the performance of 
the multimedia signal to the end user, in legacy equip- 20 
ment. For example, a 20-bit signature, applied as a con- 
tent capabilities watermark to non-critical bits in a com- 
pressed LPC-10e bitstream, appears to legacy LPC- 
10e equipment as a valid LPC-10e data message. The 
legacy equipment passes the bitstream to the appropri- 25 
ate decoding/uncompression unit and reconstructs the 
audio signal. Since the signature is applied only at pe- 
riodic intervals, the end user of the legacy equipment 
will not perceive any degradation in the reconstructed 
voice signal. Upgraded, or new equipment, that uses a 30 
watermark detection process, can determine, based on 
the watermark, if the data stream is transmitted from 
new or legacy equipment, and can act accordingly to 
negotiate the highest level of capabilities possible. 
[0023] If the destination equipment determines that 35 
the data stream is sent from upgraded equipment that 
supports a higher level of capabilities than the legacy 
equipment, it can begin negotiation processing to use 
the upgraded equipment's enhanced multimedia capa- 
bilities, rather than remaining in a degraded operational 40 
mode in which both the source and destination remain 
in the previously negotiated lower capability mode. 
[0024] Since the detection and negotiation processes 
take place between the application and transport layers, 
it is transparent to the lower levels of data processing 45 
and is unaffected by further information transformation, 
including encryption, data packing, and data routing 
techniques. This means that the watermarked multime- 
dia signal can be treated by routers, relay equipment, 
etc., as any other random data stream. so 
[0025J Although the content capabilities watermark is 
described here as embedded in voice data for use with 
communicating vocoders, the invention is not limited to 
use with voice signals. Rather, it applies to other types 
of data as well In which a signature can be embedded ss 
without unduly degrading the data, at least as that data 
is perceived by a user For Instance, the content capa- 
bilities watermark can also be embedded in image data, 
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and used by devices that negotiate capabilities to com- 
municate the image data. Examples of such devices that 
lend themselves to content watermarking include cellu- 
lar telephones, pagers and other wireless devices, to in- 
dicate the status and interworking capabilities of these 
devices, personal digital assistants (PDA's), personal 
handheld audio devices such as Motion Picture Experts 
Group (MPEG) audio players, digital video disks 
(DVD's), compact disks (CD's), and other mass data 
storage devices. 

Multimedia Authentication Watermarking Architecture 

[0026] A block diagram of a multimedia authentication 
watermarking architecture is shown in Fig. 1, and in- 
cludes a source processor 1 and a destination proces- 
sor 6. The source processor 1 includes a multimedia ap- 
plication processor 2, a digital watermark signature gen- 
erator 3, a combiner 4 and a transport/network proces- 
sor 5. 

[0027] The digital watermark signature generator 3 
outputs a unique signature to the combiner 4. The mul- 
timedia application processor 2 receives a multimedia 
data stream such as a voice, video, or data stream, from 
an application program. The multimedia application 
processor 2 compresses, or otherwise transforms the 
multimedia data stream, and outputs a processed mul- 
timedia data stream to combiner 4. 
[0028] The combiner 4 embeds the digital watermark 
signal into the multimedia data stream. For example, the 
watermark signal can be logically OR'd with the masked 
data stream at appropriate fields of the data stream, 
such as at certain bit positions, depending upon the ap- 
plication. These locations are chosen so that they have 
a minimal impact on the subjective quality at the desti- 
nation of the multimedia data stream. The watermark 
can be applied on a periodic basis to facilitate the de- 
tection process and increase the probability of detection 
of the watermark. The combiner 4 then outputs a signed 
watermarked multimedia data stream to transport/net- 
work processor 5. The network/transport processor 5 
applies the necessary message headers and control 
bits, and packetizes the data stream. Accordingly, trans- 
port/network processor 5 outputs a data packet, or data 
transmission unit, with network/transport headers ap- 
plied to a data field that includes the digital watermark 
signature. 

[0029] The destination processor 6 receives the data 
transmission unit sent by source processor 1 . The des- 
tination processor includes a transport/network proces- 
sor 7, a watermark detector 8, a watermark extraction 
mask unit 9, a multimedia application processor 1 0 and 
a capabilities negotiation processor 11. The transport/ 
network processor 7 receives the data transmission 
unit, or packet, determines if It is destined for the desti- 
nation processor 6, and if so removes the transport and 
network headers and outputs a processed data unit to 
the watermark detector 8. The watermark extraction 
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mask unit 9 generates a predetermined watermark ex- 
traction mask corresponding to the watermark signature 
generated by the generator 3 In the source processor 1 , 
and outputs that mask to watermark detector 8. Water- 
mark detector 8 uses the mask to extract the water- 
marked source capabilities signature from the data unit. 
The detector 8 determines If a watermark is present in 
the data unit and if so, identifies the source by compar- 
ing a digital signature for the source with the multimedia 
bitstream watermark. If they match, that information 
along with the extracted watermark can be sent to the 
capabilities negotiation processor 11, which can track 
the occurrences of the watermark and negotiate capa- 
bilities. The watermark detector 8 outputs the data unit 
with the watermark removed to multimedia application 
processor 10 which performs any decompression or 
other transformation necessary to recover the multime- 
dia data stream. Processor 10 then outputs the repro- 
duced multimedia data stream, such as a voice, video, 
or data signal. 

[0030] The capabilities negotiation processor 11 
keeps track of the watermark occurrence history by 
maintaining a time/history record of those occurrences. 
The capabilities negotiation processor 1 1 takes appro- 
priate action to automatically negotiate the destination 
processing based on capabilities information contained 
in the watermark. The capabilities negotiation processor 
can then determine the sources capabilities, and output 
a control/status signal indicating those abilities to a ca- 
pabilities control unit to initiate the negotiation. The ca- 
pabilities processor then uses the control/status infor- 
mation received from the source to configure its capa- 
bilities to match those of the sources to the highest com- 
mon denominator. For instance, if the control/status sig- 
nal indicates that the source has the ability to accept 
MELP or LPC-10e compressed speech, the capabilities 
processor configures itself to transmit MELP speech, 
since that speech protocol is the highest common 
speech processing protocol that both the source and 
destination can understand. Other types of capabilities 
can be negotiated using the techniques described here. 
For example, source and destination processors can 
configure themselves to select a highest capability com- 
mon vocoder, common video compression techniques, 
etc. 

[0031] Also, the source processor can embed in a wa- 
termark a group of capabilities, such as vocoder type (e. 
g., MELP, LPC-Oe, etc.) and the type of handset the 
source is currently using (e.g., an H-250 handset, etc.). 
When a group of capabilities are embedded in a water- 
mark, those capabilities can be negotiated individually 
or as a group. For example, if the destination processor 
knows that a certain handset needs additional low fre- 
quency gain to increase intelligibility, it can configure its 
audio equalizer to provide the necessary gain. 
[0032] The capabilities negotiation processor 11 can 
determine when to negotiate to an Improved capability, 
or fall back to legacy operational mode based on the 



time occurrences of the watermark. For instance, if the 
capabilities negotiator has not detected any legacy 
equipment transmissions for a pre-determined length of 
time, it can switch from a legacy operational mode to an 
5 improved operational mode. If the capabilities negotia- 
tor detects a legacy message, the destination processor 
can fall back into a legacy operational mode for compat- 
ibility with older, non-upgraded equipment. 

'0 Multimedia Authentication Watermarking Process 

[0033] A process for using the multimedia authentica- 
tion watermarking system shown in Fig. 1 , is illustrated 
in Fig. 2. The multimedia application processor 2 of 

is source processor 1 receives a multimedia data stream, 
such as a voice, video or data stream from a data source 
(12), and processes that stream to compress or other- 
wise transform the data within the stream (1 3). The dig- 
ital watermark signature generator 3 generates an ap- 

20 propriate watermark signature that includes information 
concerning the capabilities of the data source (14) for 
use in negotiations. The combiner 4 combines the gen- 
erated watermark signature with the processed data 
stream output from the multimedia application proces- 

25 sor 2, and outputs a signed data stream to a transport/ 
network processors (15). Oneway of combining the wa- 
termark signature with the processed data stream is to 
logically overwrite the data stream at the appropriate bit 
positions with the watermark signature, depending upon 

30 the application. Those bit positions are chosen so that 
they have a minimal impact on the subjective quality of 
the multimedia data stream at the destination. The wa- 
termark can be applied on a periodic basis to facilitate 
the detection process and Increase the probability of de- 

35 tection of the watermark. The transport/network proces- 
sor 5 adds the appropriate network and transport layer 
headers to the signed data stream and outputs it as a 
data transmission unit, such as a data packet (16). For 
example, in a TCP/IP network environment, the appro- 

40 priate TCP/IP headers are added to the signed data 
packet to route the packet to the destination. Other types 
of networks, such as asynchronous transfer mode 
(ATM) networks, add headers to route information as 
appropriate. For instance, in a space-based network, a 

« packet might be routed using a Proximity-1 link layer 
protocol. In that case, the Proximity- 1 headers are need- 
ed to route the packets to the final destination. 
[0034] The transport/network processor 7 of the des- 
tination processor 6 receives the data transmission unit, 

so or data packet sent from source processor 1, and re- 
moves communication headers and control Information 
from the packet (1 6). The watermark detector 8 detects 
whether the received data packet Includes a watermark, 
and if so, extracts it (17). The watermark is analyzed to 

» determine from it the capabilities of the source (18). 
Based on those determined capabilities, the destination 
negotiates with the source communication capabilities 
to utilized in subsequent communications (1 9). The sub- 



5 



X } 



9 



EP 1 326 422 A2 



10 



sequent communication is commenced using the nego- 
tiated parameters negotiated between the destination 
and source (20). 

[0035] A more detailed illustration of the process for 
extracting a digital watermark containing information 
concerning a source's capabilities is shown in Fig. 3. 
Here, the data transmission unit is received at the des- 
tination processor which detects if it is addressed for the 
destination (21). Watermark generator 9 generates a 
predetermined watermark extraction mask (22). The re- 
ceived data transmission unit is analyzed to determine 
if a watermark is present, if so, the watermark is extract- 
ed (23). The watermark is examined to determine if it 
contains capability information about the source unit 
that transmitted the transmission unit (24). If the water- 
mark contains source capability information the source 
unit's capability parameters are detected from the ex- 
tracted watermark signature (25). The destination proc- 
essor then negotiates communication capabilities with 
the source processor (26) and continues with subse- 
quent communications (27). After extracting the capa- 
bility parameters from the watermark signature the wa- 
termark is removed from the processed multimedia data 
in the transmission data unit (28). The processed data 
is again processed to decompress or otherwise inverse 
transform the data to recover the multimedia data 
stream (29) and output that recovered data stream to a 
destination application. 

[0036] If the extracted watermark does not compare 
with the predetermined watermark generated by the 
destination watermark generator, then default, or fall- 
back capabilities are used for communication with the 
source processor (30). 

[0037] The watermark generated by the source can 
be either static or dynamic depending on whether the 
source capabilities change with time. For instance, prior 
to a software update, a particular source may only be 
capable of running LPC-1 Oe. After the software is up- 
dated, it might have the capability to run other vocoders, 
such as a MELP vocoder. Also, the capabilities may be 
location dependent. If the source contains a GPS receiv- 
er, or other position determination device, it can change 
ifs capabilities depending upon it's location. 
[0038] It will be understood that the digital watermark- 
ing signature generator and the watermark detector can 
be embodied in software, hardware, or a combination of 
both technologies. The capabilities negotiation proces- 
sor can keep track of the watermark occurrence history 
and take appropriate action to automatically negotiate 
the destination processing. The multimedia application 
processing converts the multimedia bitstream back to 
the audioA/oice/video domain. 

Wireless Radio System Application 

[0039] An example of an application that uses digital 
watermarks for negotiating between the source and 
destination units shown in Fig. 1 is a wireless radio sys- 



tem used in a military environment. Wireless radios in 
such a system can use earlier, or legacy, vocoders such 
as an LPC-1 Oe vocoder, or a newer, more capable voc- 
oder such as a MELP vocoder. A block diagram of a 
5 wireless radio source unit 30 is shown in Fig. 4. The 
source radio includes a speech generation unit 31 that 
converts raw speech into a sampled signal that is sam- 
pled, for example, at a sampling interval of 22.5 millisec- 
onds. The sampled raw speech is input to a speech 
'O compression unit 32 that compresses the speech ac- 
cording to either the MELP or the LPC-1 Oe standards. 
The compressed speech is output and supplied to a first 
buffer 33 that distinguishes the critical bits 33b in the 
compressed speech from the non-critical bits 33a. The 
is compressed speech frame is supplied to a logical AND 
circuit 34. Also supplied to the logical AND circuit is a 
signature mask output from a signature mask unit 35. 
The signature mask includes a set of logical zeros (0's) 
35a and a set of logical ones (1's) 35b arranged with a 
20 priori knowledge of the watermark arrangement. The 
logical zeros 35a correspond to the positions of the non- 
critical bits in the compressed speech frame. Logically 
AND'ing the compressed speech frame bits with the sig- 
nature mask results in a compressed speech frame with 
25 the non-critical bits set to zero and the critical bits re- 
taining their value from the compressed speech frame. 
[0040] The compressed speech frame output from 
logical AND circuit 34 is input to a second buffer 36. The 
compressed speech frame stored in buffer 36 includes 
30 a first storage area 36a that includes the compressed 
speech frame non-critical bits that have been set to zero 
36a and the compressed speech frame critical bits 36b 
that make up the speech frame output from logical AND 
circuit 34. The speech frame stored in buffer 36 that in- 
35 dudes the zero-value non-critical bits is applied to a log- 
ical OR circuit 37. Also applied to the logical OR circuit 
37 are a set of source capability bits stored in a first area 
38a of a source capabilities buffer 38. Stored in a second 
area 38b of buffer 38 is a set of logical zeros that corre- 
40 spond to the positions of the compressed speech frame 
critical bits in the speech frame. The source capability 
bits stored in area 38a are set according to source ca- 
pability information concerning capabilities of the source 
radio. The capability information can include, for exam- 
45 pie, source vocoder types, source vocoder revision 
numbers, source ID, etc. The logical OR circuit 37 com- 
bines the source capabilities information from buffer 38 
with the speech frame recorded in buffer 36. The effect 
of that operation is to combine the source capability in- 
50 formation with the compressed speech frame non-criti- 
cal bits. The resulting output of the logical OR circuit is 
output to a watermarked speech frame buffer 39 that 
contains a watermarked speech frame having a set of 
source capability bits 39a and a set of compressed 
55 speech frame critical bits 39b. Because the source ca- 
pability bits are located in the non-critical bit positions 
of the compressed speech frame, applying the water- 
mark has little noticeable effect on the speech frame. 
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The watermark speech frame 39 is then output from the 
source unit 30 and transmitted to a destination radio. 
[0041] A destination radio 40 is shown in Fig. 5. The 
destination radio receives the watermark speech frame 
and stores it in a speech frame storage buffer 41. The 
received speech frame corresponds to the watermarked 
speech frame transmitted from source radio 30, and in- 
cludes a set of source capability bits stored in a source 
capabilities area 41 a and a set of compressed speech 
frame critical bits stored in a critical bit area 41 b. The 
destination radio uses the watermark speech frame to 
extract the source capability information and to extract 
the speech information from the critical bit area for re- 
producing the speech. A first logical AND circuit 42 ex- 
tracts the compressed speech frame critical bits from 
the watermarked speech frame. A data extraction mask, 
held in a data extraction mask buffer 43, includes a set 
of logical zeroes 43a at locations corresponding to lo- 
cations of the source capabilities information within the 
watermarked speech frame, and a set of logical ones 
43b at locations corresponding to locations of the com- 
pressed speech frame critical bits within the water- 
marked speech frame. The data extraction mask applies 
those logical zeroes and ones to the logical AND circuit 
42 which operates to output the compressed speech 
frame with the non-critical bits set to zero. Applying the 
logical ones 43b to the watermark speech frame causes 
the compressed speech frame critical bits to be output 
from the logical AND circuit 42 unaltered. 
[0042J The watermark speech frame 41 is also ap- 
plied to a second logical AND circuit 44. Also applied to 
logical AND circuit 44 is a signature extraction mask 
held in signature extraction buffer 45, which also in- 
cludes an area 45a for storing logical ones and an area 
45b for storing logical zeros. As with the data extraction 
mask described above, the logical AND circuit 44 ap- 
plies the signature extraction mask 45 to the water- 
marked speech frame, although it outputs the source ca- 
pability bits 41a unaltered and sets the critical bits 41b 
to zero. That is, the logical AND circuit 44 applies the 
logical ones in area 45a to the source capability bits 41 a 
in the watermarked speech frame and allows those 
source capability bits to pass unaltered. However, the 
logical AND circuit 44 applies the logical zeroes in area 
45b to the compressed speech frame critical bits 41 b of 
the watermark speech frame, thereby setting those crit- 
ical bits to zero. Hence, logical AND circuit 44 outputs 
source capability bits 41 a with the compressed speech 
frame critical bits 41 b set to zero. 
[0043] The source capability information output from 
logical AND circuit 44 is stored in a source capabilities 
signature unit 46 which includes a source capability sig- 
nature area 46a. Unit 46 can also include an area for 
storing the critical bits that were set to zero by logical 
AND circuit 44, although those bits need not be retained. 
The source capabilities unit 46 decodes the source ca- 
pabilities signature, and from that signature determines 
the source radio's capabilities and outputs information 



to that effect. The source capabilities information can 
include, for example, the vocoder-type, vocoder revision 
number, ID, etc. of the source radio. This capability in- 
formation is supplied to a negotiations processor 47 that 
5 negotiates communications, or other parameters with 
the source radio. The negotiations processor 47 uses 
the source capabilities information to determine the 
common capabilities between the source and the des- 
tination radios based on the source capability signature 
'0 and the capabilities information supplied by the destina- 
tion radio. The source capabilities unit 46 determines 
the highest level of source capability commonality be- 
tween the source and destination radios and uses that 
information to negotiate with the source radio. 

'5 [0044] A compressed speech buffer 48 stores the 
compressed speech frame non-critical bits set to zero 
48a and the compressed speech frame critical bits 48b. 
The speech frame in buffer 48 is passed to a speech 
decompression unit 49. The speech decompression unit 

20 receives from the source capabilities unit 46 capability 
information designating decompression parameters, 
such as the type of decompression to perform (e.g., 
MELP or LPC-1 Oe). The speech decompression unit 49 
uses that information for setting parameters for the 

25 speech decompression. The speech decompression 
unit 49 operates to decompress the speech frame based 
on the decompression parameters supplied from source 
capabilities unit 46, and outputs the raw speech signal 
50, again at the sampling rate of 22.5 milliseconds. 

30 [0045] In this manner the destination radio 40 can op- 
erate with the highest level of capabilities that are in 
common with the source radio in order to communicate 
with the source radio and to process the speech signal. 
[0046] A process for operating the wireless radios 

35 shown in Figs. 4 and 5 is illustrated in the flowcharts 
shown in Figs. 6A and 6B. Fig. 6A illustrates the opera- 
tions performed by the source wireless radio 30 in Fig. 
4. Here, an uncompressed multimedia data stream is 
compressed 51 using a compression algorithm availa- 

40 ble to the source radio. Capabilities of the source unit 
52 are determined and that capability information is 
used to generate a source capabilities signature 53. The 
compressed multimedia data stream signal is applied to 
a signature mask that masks the data stream non-criti- 

« cal bits for use in carrying the source capabilities infor- 
mation 54. The generated source capabilities signature 
is then applied as a watermark to the masked and com- 
pressed multimedia data stream 55. The watermarked 
multimedia data stream is then transmitted to a destina- 

50 tion wireless radio 56. The transmission is shown by way 
of connector A In Fig. 6A connecting to a similar point 
in Fig. 6B. 

[0047] The destination wireless radio receives the wa- 
termarked multimedia data stream 57, masks the da- 
55 tastream non-critical bits 58, and extracts a signature 
from the watermark 59. The extracted signature is used 
to recover the source capabilities signature from the da- 
ta stream 60. The source capabilities are determined 
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from the recovered source capabilities signature 61. 
Those source capabilities are compared 62 with capa- 
bilities of the destination radio 63. Based on the com- 
pared source and destination capabilities, the highest 
level of capabilities common to both the source and des- 5 
tination are determined 64 and capabilities information 
consistent with that determination is output for use in 
decompression and subsequent communication with 
the source radio. 

[0048] Once the signature is extracted from the wa- w 
termark in operation 59, the data stream is then recov- 
ered 65. The multimedia data stream is then decom- 
pressed using the determined common source and des- 
tination capabilities 66. Upon the decompression, the 
multimedia data stream is recovered 67 and available '5 
for speech reproduction. 

[0049] Although watermarks are described above in 
terms of communicating a plurality of attributes, alterna- 
tively, if desired, a single attribute only can be embedded 
in the multimedia data as a watermark for use in config- 20 
uring a destination radio. For example, the single at- 
tribute embedded as a watermark can be an indication 
that the source radio compressed the speech signal ac- 
cording to the MELP standard. 

25 

Applications 

[0050] The systems and methods described here can 
be applied to many applications, including, but not lim- 
ited to the following applications. 30 

• Military mobile, wireless communication equip- 
ment, including radios, secure terminals. 

• Multimedia over IP equipment including voice, vid- 35 
eo and data communications equipment. 

• Mobile networking multimedia equipment, including 
cell-phones, networked radios, network data and 
video terminals, PDA's, Digital Paging, Advanced *o 
HDTV. 

• Point-to-Point, broadcast, multicast, and confer- 
encing multimedia equipment. 

45 

• Non-wireless communication equipment, including 
internet, intranet and point-to-point where known 
ID, location or user capabilities is important. 

[0051] The methods, systems and apparatuses de- so 
scribed here can be used whenever two devices must 
communicate and offer varying service, accommodate 
different versions or provide flexible Interfacing. For ex- 
ample, in a mobile client/server environment these tech- 
niques can be used to synchronize varying versions of ss 
the clients with the server's resources. For instance, a 
PDA client might use the digital watermarking tech- 
niques described here to authenticate its ability to use 



a particular feature, allow for conversion of data or pro- 
vide upgraded services. Newer PDAs with more capa- 
ble software can gain access to better services/features 
than can older PDAs that do not have the more capable 
software. The digital watermarking techniques de- 
scribed here also can be used with other devices, such 
as cellular telephones to identify the telephone's ability 
to receive pages or e-mails. An example of such a use 
with telephones is where two telephones use the same 
telephone number. During a negotiation process the tel- 
ephones inform a base station of the services the tele- 
phones are capable of providing. One telephone might 
allow a particular service because that telephone is a 
newer model that supports newer features. However, 
another telephone might be an older telephone that is 
incapable of the supporting newer features. According- 
ly, each telephone informs the base station of its capa- 
bilities by using a digital watermark with information con- 
cerning the telephone's capabilities included in the wa- 
termark. The base station negotiates with each tele- 
phone individually, based on that telephone's capabili- 
ties indicated in the watermark, thereby allowing each 
telephone to use the features it has available and to op- 
erate with the highest level of capabilities that the tele- 
phone can support. 

[0052] Having described systems and methods for 
using a digital watermark to negotiate compatible capa- 
bilities, it is believed that other modifications, variations 
and changes will be suggested to those skilled in the art 
in view of the teachings set forth herein. It is therefore 
to be understood that all such variations, modifications 
and changes are believed to fall within the scope of the 
present invention as defined by the appended claims. 
Although specific terms are employed herein, they are 
used in their ordinary and accustomed manner only, un- 
less expressly defined differently herein, and not for pur- 
poses of limitation. 



Claims 

1. A method of communicating information concerning 
an attribute of a data source unit, comprising: 

generating (14) a watermark based on said at- 
tribute of the data source unit; 
combining (15) the watermark with a data 
stream (12) from the data source unit, thereby 
generating a data transmission unit; and 
transmitting (16) the data transmission unit to 
a destination unit. 

2. A method of determining capabilities of a data 
source unit, comprising: 

receiving a data transmission unit (1 6) contain- 
ing a data stream having a watermark, the wa- 
termark containing information concerning an 
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attribute of a data source unit; and 
determining (1 7), based on the watermark (1 6), 
said attribute of the data source unit. 

The method of claim 1 , further comprising: 5 

compressing the data stream (13, 51) accord- 
ing to a source compression algorithm, 
detecting a capability of the source unit (52); 
generating a signature (53) based on the de- 10 
tected capability of the source unit; and 
applying (55) the signature as the watermark to 
the compressed data stream to generate the 
data transmission unit 

15 

wherein the attribute upon which the watermark is 
generated identifies the source compression algo- 
rithm. 

The method of claim 2, wherein the transmission 20 
data unit is received at a destination unit, and the 
method further comprises 

extracting (59) a signature from the watermark, 
determining (61 ) asource unit attribute from the 25 
extracted signature, 

determining (63) a destination unit attribute cor- 
responding to the data source unit attribute, 
comparing (62) the source unit attribute with the 
destination unit attribute, determining (64) a ca- 30 
pability common to both the source and desti- 
nation units based on the compared attributes; 
and 

negotiating a parameter for use in communicat- 
ing between the data source unit and the des- 35 
tination unit based on the determined common 
capability. 

A data source apparatus (1), comprising: 

40 

a data stream processor (2) configured to out- 
put a data stream; 

a signature generator (3) configured to gener- 
ate a signature containing Information concern- 
ing at least one attribute of the data source ap- 45 
paratus; wherein said at least one attribute of 
the data source unit corresponds to at least one 
capability of the data source unit and 
a combiner (4) configured to receive data 
stream and signature, to embed the signature so 
as a watermark within the date stream, and to 
output a watermarked data unit. 

A data source apparatus (1) suitable for communi- 
cation with a destination unit (6), comprising: 55 

means (2) for generating a data stream; 
means (3) for generating a watermark based on 



a plurality of capabilities of the data source ap- 
paratus; 

means (4) for combining the watermark with the 
data stream, thereby generating a data trans- 
mission unit; and 

means (5) for transmitting the data transmis- 
sion unit to a destination unit(6). 

7. A destination apparatus (6), comprising: 

a reception unit (7) configured to receive a data 
transmission unit having multimedia data con- 
taining an embedded watermark, wherein the 
watermark contains information concerning at 
least one capability of a source data unit out- 
putting the multimedia data; 
a watermark detector (8) configured to detect 
the watermark embedded in the multimedia da- 
ta; and 

a capabilities unit (46) configured to extract 
source data unit capability information from the 
watermark and to control operation of the des- 
tination apparatus according to the extracted 
capability information. 

8. The destination apparatus of claim 7, further com- 
prising a capabilities negotiation processor (1 1 , 47) 
configured to negotiate with the source data unit 
communications parameters based on the capabil- 
ity information extracted from the watermark. 

9. The destination apparatus of claim 7, wherein mul- 
timedia data contained in the data transmission unit 
Is compressed according to a compression algo- 
rithm employed in the data source unit, and wherein 
the source data unit capability information extracted 
from the watermark includes information identifying 
said compression algorithm, the apparatus further 
comprising a multimedia data decompression unit 
(49) configured based on said information identify- 
ing said compression algorithm to decompress the 
multimedia data. 

10. An destination apparatus (6) for communicating 
with a data source unit, comprising: 

means (7) for receiving a data transmission unit 
containing a data stream having a watermark, 
the watermark containing information concern- 
ing a plurality of capabilities of a data source 
unit; and 

means (11, 44, 45,46) for determining, based 
on. the watermark, said plurality of capabilities 
of the data source unit. 



9 




10 



o 



EP 1 326 422 A2 



( START ) 



RECEIVE MULTIMEDIA DATA STREAM FROM A DATA SOURCE 



12 



PROCESS (COMPRESS/TRANSFORM) THE MULTIMEDIA DATA STREAM 



13 
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